363 research outputs found

    An intuitive Python interface for Bioconductor libraries demonstrates the utility of language translators

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Computer languages can be domain-related, and in the case of multidisciplinary projects, knowledge of several languages will be needed in order to quickly implements ideas. Moreover, each computer language has relative strong points, making some languages better suited than others for a given task to be implemented. The Bioconductor project, based on the R language, has become a reference for the numerical processing and statistical analysis of data coming from high-throughput biological assays, providing a rich selection of methods and algorithms to the research community. At the same time, Python has matured as a rich and reliable language for the agile development of prototypes or final implementations, as well as for handling large data sets.</p> <p>Results</p> <p>The data structures and functions from Bioconductor can be exposed to Python as a regular library. This allows a fully transparent and native use of Bioconductor from Python, without one having to know the R language and with only a small community of <it>translators</it> required to know both. To demonstrate this, we have implemented such Python representations for key infrastructure packages in Bioconductor, letting a Python programmer handle annotation data, microarray data, and next-generation sequencing data.</p> <p>Conclusions</p> <p>Bioconductor is now not solely reserved to R users. Building a Python application using Bioconductor functionality can be done just like if Bioconductor was a Python package. Moreover, similar principles can be applied to other languages and libraries. Our Python package is available at: <url>http://pypi.python.org/pypi/rpy2-bioconductor-extensions/</url></p

    The genetic organisation of prokaryotic two-component system signalling pathways

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Two-component systems (TCSs) are modular and diverse signalling pathways, involving a stimulus-responsive transfer of phosphoryl groups from transmitter to partner receiver domains. TCS gene and domain organisation are both potentially informative regarding biological function, interaction partnerships and molecular mechanisms. However, there is currently little understanding of the relationships between domain architecture, gene organisation and TCS pathway structure.</p> <p>Results</p> <p>Here we classify the gene and domain organisation of TCS gene loci from 1405 prokaryotic replicons (>40,000 TCS proteins). We find that 200 bp is the most appropriate distance cut-off for defining whether two TCS genes are functionally linked. More than 90% of all TCS gene loci encode just one or two transmitter and/or receiver domains, however numerous other geometries exist, often with large numbers of encoded TCS domains. Such information provides insights into the distribution of TCS domains between genes, and within genes. As expected, the organisation of TCS genes and domains is affected by phylogeny, and plasmid-encoded TCS exhibit differences in organisation from their chromosomally-encoded counterparts.</p> <p>Conclusions</p> <p>We provide here an overview of the genomic and genetic organisation of TCS domains, as a resource for further research. We also propose novel metrics that build upon TCS gene/domain organisation data and allow comparisons between genomic complements of TCSs. In particular, '<it>percentage orphaned TCS genes</it>' (or 'Dissemination') and '<it>percentage of complex loci</it>' (or 'Sophistication') appear to be useful discriminators, and to reflect mechanistic aspects of TCS organisation not captured by existing metrics.</p

    phyloXML: XML for evolutionary biology and comparative genomics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Evolutionary trees are central to a wide range of biological studies. In many of these studies, tree nodes and branches need to be associated (or annotated) with various attributes. For example, in studies concerned with organismal relationships, tree nodes are associated with taxonomic names, whereas tree branches have lengths and oftentimes support values. Gene trees used in comparative genomics or phylogenomics are usually annotated with taxonomic information, genome-related data, such as gene names and functional annotations, as well as events such as gene duplications, speciations, or exon shufflings, combined with information related to the evolutionary tree itself. The data standards currently used for evolutionary trees have limited capacities to incorporate such annotations of different data types.</p> <p>Results</p> <p>We developed a XML language, named phyloXML, for describing evolutionary trees, as well as various associated data items. PhyloXML provides elements for commonly used items, such as branch lengths, support values, taxonomic names, and gene names and identifiers. By using "property" elements, phyloXML can be adapted to novel and unforeseen use cases. We also developed various software tools for reading, writing, conversion, and visualization of phyloXML formatted data.</p> <p>Conclusion</p> <p>PhyloXML is an XML language defined by a complete schema in XSD that allows storing and exchanging the structures of evolutionary trees as well as associated data. More information about phyloXML itself, the XSD schema, as well as tools implementing and supporting phyloXML, is available at <url>http://www.phyloxml.org</url>.</p

    Pharmacological levels of withaferin A (Withania somnifera) trigger clinically relevant anticancer effects specific to triple negative breast cancer cells

    Get PDF
    Withaferin A (WA) isolated from Withania somnifera (Ashwagandha) has recently become an attractive phytochemical under investigation in various preclinical studies for treatment of different cancer types. In the present study, a comparative pathway-based transcriptome analysis was applied in epithelial-like MCF-7 and triple negative mesenchymal MDA-MB-231 breast cancer cells exposed to different concentrations of WA which can be detected systemically in in vivo experiments. Whereas WA treatment demonstrated attenuation of multiple cancer hallmarks, the withanolide analogue Withanone (WN) did not exert any of the described effects at comparable concentrations. Pathway enrichment analysis revealed that WA targets specific cancer processes related to cell death, cell cycle and proliferation, which could be functionally validated by flow cytometry and real-time cell proliferation assays. WA also strongly decreased MDA-MB-231 invasion as determined by single-cell collagen invasion assay. This was further supported by decreased gene expression of extracellular matrix-degrading proteases (uPA, PLAT, ADAM8), cell adhesion molecules (integrins, laminins), pro-inflammatory mediators of the metastasis-promoting tumor microenvironment (TNFSF12, IL6, ANGPTL2, CSF1R) and concomitant increased expression of the validated breast cancer metastasis suppressor gene (BRMS1). In line with the transcriptional changes, nanomolar concentrations of WA significantly decreased protein levels and corresponding activity of uPA in MDA-MB-231 cell supernatant, further supporting its anti-metastatic properties. Finally, hierarchical clustering analysis of 84 chromatin writer-reader-eraser enzymes revealed that WA treatment of invasive mesenchymal MDA-MB-231 cells reprogrammed their transcription levels more similarly towards the pattern observed in non-invasive MCF-7 cells. In conclusion, taking into account that sub-cytotoxic concentrations of WA target multiple metastatic effectors in therapy-resistant triple negative breast cancer, WA-based therapeutic strategies targeting the uPA pathway hold promise for further (pre)clinical development to defeat aggressive metastatic breast cancer

    ANDES: Statistical tools for the ANalyses of DEep Sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The advancements in DNA sequencing technologies have allowed researchers to progress from the analyses of a single organism towards the deep sequencing of a sample of organisms. With sufficient sequencing depth, it is now possible to detect subtle variations between members of the same species, or between mixed species with shared biomarkers, such as the 16S rRNA gene. However, traditional sequencing analyses of samples from largely homogeneous populations are often still based on multiple sequence alignments (MSA), where each sequence is placed along a separate row and similarities between aligned bases can be followed down each column. While this visual format is intuitive for a small set of aligned sequences, the representation quickly becomes cumbersome as sequencing depths cover loci hundreds or thousands of reads deep.</p> <p>Findings</p> <p>We have developed ANDES, a software library and a suite of applications, written in Perl and R, for the statistical ANalyses of DEep Sequencing. The fundamental data structure underlying ANDES is the position profile, which contains the nucleotide distributions for each genomic position resultant from a multiple sequence alignment (MSA). Tools include the root mean square deviation (RMSD) plot, which allows for the visual comparison of multiple samples on a position-by-position basis, and the computation of base conversion frequencies (transition/transversion rates), variation (Shannon entropy), inter-sample clustering and visualization (dendrogram and multidimensional scaling (MDS) plot), threshold-driven consensus sequence generation and polymorphism detection, and the estimation of empirically determined sequencing quality values.</p> <p>Conclusions</p> <p>As new sequencing technologies evolve, deep sequencing will become increasingly cost-efficient and the inter and intra-sample comparisons of largely homogeneous sequences will become more common. We have provided a software package and demonstrated its application on various empirically-derived datasets. Investigators may download the software from Sourceforge at <url>https://sourceforge.net/projects/andestools</url>.</p

    Nurse led, primary care based antiretroviral treatment versus hospital care: a controlled prospective study in Swaziland

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Antiretroviral treatment services delivered in hospital settings in Africa increasingly lack capacity to meet demand and are difficult to access by patients. We evaluate the effectiveness of nurse led primary care based antiretroviral treatment by comparison with usual hospital care in a typical rural sub Saharan African setting.</p> <p>Methods</p> <p>We undertook a prospective, controlled evaluation of planned service change in Lubombo, Swaziland. Clinically stable adults with a CD4 count > 100 and on antiretroviral treatment for at least four weeks at the district hospital were assigned to either nurse led primary care based antiretroviral treatment care or usual hospital care. Assignment depended on the location of the nearest primary care clinic. The main outcome measures were clinic attendance and patient experience.</p> <p>Results</p> <p>Those receiving primary care based treatment were less likely to miss an appointment compared with those continuing to receive hospital care (RR 0·37, <it>p </it>< 0·0001). Average travel cost was half that of those receiving hospital care (<it>p </it>= 0·001). Those receiving primary care based, nurse led care were more likely to be satisfied in the ability of staff to manage their condition (RR 1·23, <it>p </it>= 0·003). There was no significant difference in loss to follow-up or other health related outcomes in modified intention to treat analysis. Multilevel, multivariable regression identified little inter-cluster variation.</p> <p>Conclusions</p> <p>Clinic attendance and patient experience are better with nurse led primary care based antiretroviral treatment care than with hospital care; health related outcomes appear equally good. This evidence supports efforts of the WHO to scale-up universal access to antiretroviral treatment in sub Saharan Africa.</p
    corecore